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Synthetic bio-molecular spiders with "legs" made of single-stranded segments of DNA can move 
on a surface covered by single-stranded segments of DNA called substrates when the substrate DNA 
is complementary to the leg DNA. If the motion of a spider does not affect the substrates, the spider 
behaves asymptotically as a random walk. We study the diffusion coefficient and the number of 
visited sites for spiders moving on the square lattice with a substrate in each lattice site. The spider's 
legs hop to nearest-neighbor sites with the constraint that the distance between any two legs cannot 
exceed a maximal span. We establish analytic results for bipedal spiders, and investigate multi- 
leg spiders numerically. In experimental realizations legs usually convert substrates into products 
(visited sites). The binding of legs to products is weaker, so the hopping rate from the substrates is 
smaller. This makes the problem non-Markovian and we investigate it numerically. We demonstrate 
the emergence of a counter-intuitive behavior — the more spiders are slowed down on unvisited sites, 
the more motile they become. 

PACS numbers: 87.15Vv, 02.50.Ey, 05.40.-a, 82.39.Fk 



I. INTRODUCTION 

Recent advances in DNA nano-fabrication technology 
(see [IH!]) have led to the constructions of multi-pedal 
walking molecular devices. The first was a bipedal ob- 
ject [3] walking on a one-dimensional path by DNA set 
strands with nucleic acid domains complementary to 
molecular imprints on the device legs and the substrate. 
Since then, several other similar bipedal DNA walkers 
have been synthesized (see and a review [5]). 

A different molecular design has been implemented in 
0[TD]. The resulting objects, known as molecular spi- 
ders, usually have many legs. Each leg (a short single 
strand of DNA) can bind to the substrate through the 
Watson-Crick base pair formation. A bound leg can ei- 
ther detach from the substrate without modifying it, or 
it can catalyze the cleavage of the substrate creating two 
product strands. The lower product remains bound to 
the surface, while the upper product is free to float away 
in solution. There is a residual binding of the leg to 
the remaining product, but it is weaker than the leg- 
substrate binding. Utilizing the effect of the spider's 
motion on the molecular tracks it is possible to design 
environments where molecular spiders demonstrate some 
basic robotic behaviors [10], 

When a spider is released on a surface coated with 
oligonucleotide substrates, it can cleave thousands of sub- 
strates before eventually detaching. The small size of 
spiders makes experimental observation of their motion 
very challenging. Atomic force microscopy imaging and 
single-molecule fluorescence studies have been successful 
to a certain degree [TU] , yet neither the details of the spi- 
der's gait nor the individual paths of spiders have been 
resolved with sufficient certainty. Perspectives and chal- 
lenges of the experimental work are surveyed in [11] . 

A number of modeling studies describing the motion 
of molecular spiders have been recently carried out. The 
motion of a single spider on a one-dimensional track 



has been investigated in Refs. [T31Q3]. The first article 
[T2"] ignores the difference between the substrate and the 
product and makes a number of simplifying assumptions 
about the gait of the spiders. There are no limitations on 
the number of legs, however. The chief result of Ref. [T2] 
is that the spider (which is a complicated self-interacting 
multi-leg object) can be replaced by a particle character- 
ized by a single number, the diffusion coefficient; for the 
simplest gaits, the diffusion coefficient was analytically 
computed. In [T3] we mainly considered bipedal spiders, 
but took into account that spiders affect the substrate 
(turn it into products). Despite the non-Markovian na- 
ture of the problem, the coarse-grained behavior turned 
out to be surprisingly simple, namely the difference in 
residence times on the substrate and the product leads 
to the effective bias into the unvisited region. 

Recent papers [T1HT7] utilized more detailed and com- 
plicated models mimicking the gait of spiders moving 
on one-dimensional tracks, the possibility of the detach- 
ment, etc. These studies numerically confirm the ten- 
dency of spiders to move into the unvisited region leaving 
behind the trail of the product. This key feature was ob- 
served experimentally (HHH! and proved theoretically [T3] 
in the realm of simple models. An interesting new fea- 
ture noticed in |16) is the emergence of a super-diffusive 
growth of the mean-square displacement, (x 2 (t)) ~ t a 
with 1 < a < 2, which holds on a surprisingly long time 
span; eventually, the super-diffusive growth crosses over 
to the diffusive growth. Several rigorous results concern- 
ing the asymptotic behaviors (limit theorems, transience, 
recurrence, and rate of escape) of molecular spiders have 
been established in pSEE]. In Refs. [UlEl] the motion 
of spiders in random environments has been studied. 

As in our previous work [121 113] , throughout this pa- 
per we will assume an idealized gait — the goal is not 
to mimic the complicated (and poorly known) gait of 
molecular spiders, but to qualitatively understand spi- 
ders' macroscopic characteristics in the realm of simple 



2 




FIG. 1: A molecular spider with four legs moving on the 
square lattice. The distances between any two legs should 
not exceed a threshold value (the maximal span S). As long 
as this constraint is obeyed, each leg can hop to the empty 
(not occupied by another leg) nearest neighboring site. As 
long as the rate of leg attachment greatly exceeds the rate of 
leg detachment, all legs are attached most of the time and for 
relatively short time intervals one of the leg is detached as 
illustrated above. 



models [35]. Previous theoretical analyses [T^HTT] have 
been focused on the motion of molecular spiders on one- 
dimensional tracks, while the goal of this work is to study 
a single spider moving on a two-dimensional lattice. If 
not stated otherwise, we tacitly assume that the rate of 
attachment greatly exceeds the rate of detachment. In 
this case, the relative time when one leg is detached (this 
situation is illustrated on Fig. [T]) is negligible and hence 
the possibility that two or more legs are detached simul- 
taneously can be disregarded. 

The following properties of molecular spiders will al- 
ways be assumed: 

• Hopping: When a leg detaches, it re-attaches to 
a neighboring site. 

• Exclusion: Two legs cannot be attached to the 
same site. 

• Constraint: The distance between any two legs 
does not exceed a certain maximal span. 

The restriction to the nearest neighbor hopping can be 
relaxed (long-distance gaits have been probed in the one- 
dimensional setting [12]); the exclusion is of course the 
fundamental feature. The last property is the simplest 
constraint that assures the compactness of the spider. 
For one-dimensional spiders several types of constraint 
were considered, and the diffusion coefficients of these 
spiders were exactly obtained in most cases, due to map- 
pings to exclusion processes |12) . 

The two-dimensional case is particularly important in 
current experiments. The actual situation is rather com- 
plicated, e.g., in some experiments there are a few layers 
of the substrate and hence a quasi two-dimensional set- 
ting seems more appropriate; additionally, the substrates 
do not form a perfect square lattice. Nevertheless, we 
shall assume that a spider with aforementioned simple 
gait is placed on a square lattice [53] . We emphasize that 
the constraint regarding the separations between the legs 
roughly describes real molecular spiders. One realiza- 
tion of the constraint which is convenient for the numer- 
ical implementation defines the distance between points 



(xi,yi) and (2:2,2/2) via naax(|xi - x 2 \, \yi - y 2 \)- The 
neighborhood in this metric is geometrically a square, 
and it is often called the von Neumann neighborhood. 
For example, the simplest "von Neumann" spider is the 
bipedal spider with legs separated by distance S = 1 at 
most. There are four possible configurations: horizontal, 
vertical, and two diagonal 
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where • represents a leg, and o an empty site. We found 
that the diffusion coefficient 24 of this spider is equal to 
1/4. Generally von Neumann spiders are more amenable 
to analysis, and we study them as well as more realistic 
Euclidean spiders. 

The rest of this paper is organized as follows. Sec- 



tion [n] is devoted to von Neumann spiders. In Sec. Ill 
we present main results for bipedal Euclidean spiders; 
the detailed derivations are given in the Appendix. An 
analysis of molecular spiders simplifies when the maximal 
span increases and in Sec. |IV| we describe corresponding 
asymptotic behaviors. In the following sections we relax 
some of the assumptions about the spider gait, the influ- 
ence of the spider's motion on the environment, etc. In 
Sec. [V] we consider the influence of memory. We model 
the difference between the product and the substrate by 
postulating that the leg spends (on average) more time at 
newly visited sites, i.e., on the substrates. This slowdown 
in comparison with the motion on the products leads to 
faster (covering more unvisited sites) spiders; the reason 
for this phenomenon is an effective bias towards unvisited 
sites. This behavior has been observed and explained in 
one dimension |13j . and it continues to hold in two di- 
mensions. In Sec. |VI| we investigate what happens when 
the attachment rate is finite. We compute the mean time 
the spider spends on the surface and show that the prob- 
ability to remain attached decays exponentially if the at- 
tachment rate greatly exceeds the detachment rate. We 



summarize our findings in Sec. VII 



II. VON NEUMANN SPIDERS 

A spider is quantified by lattice points = (xj,2/«) 
with i = 1, . . . , L which describe the positions of its legs. 
We assume that the spider has maximal span S, so the 
distance between any two legs is < S. Each leg, when al- 
lowed, hops to neighboring sites (up, down, left, or right) 
at rate 1 in each direction. In this section we use the 
metric which assigns the distance between any two legs 
Yi and Yj according to the rule 



r, - r, 



= max 



(\xi-Xjl\yi-yj\) 



In this metric, the neighborhood around the origin, i.e. 
the disk |r|oo < S, is the square in Euclidean metric [see 
the example in Q]; such a neighborhood is often called 
the von Neumann neighborhood. 
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A spider with maximal span 5 between the legs should 
therefore occupy the square with (5 + l) 2 lattice sites; 
after shifting, this square becomes 



{{x, y) ■■ x = 0, . . . , 5; y = 0, . . . , 5} 



(2) 



This spider can therefore have at most (5+1) 2 legs. Only 
spiders with 



L < 5(5+1) 



(3) 



legs are mobile. More precisely, spiders with more legs 
than the above upper limit, 5(5+1) < L < (5+1) 2 , have 
a few legs which can move, but each such spider forever 
remains within its surrounding square (|2j), provided that 
we ignore multiple legs being detached simultaneously. 

Let us first calculate the total number of possible con- 
figurations of the legs. To avoid multiple counting of 
configurations which are obtained by translation, we use 
the convention that there must be a leg both in the bot- 
tom row {y — 0) and in the leftmost column (x = 0), 
as in the example in ([I]). With this convention the total 
number of configurations is 



e(L,5) 



(5 + 1) 2 
L 



JSiS, 



(4) 



To establish Q let us ignore for a moment the afore- 
mentioned convention restricting the position of one leg. 
The number of such unrestricted configurations of a spi- 
der with L legs inside the m x n rectangular is equal to 
Sl(m,n) = (™ n ). To obtain 6(L, 5) we take the num- 
ber of configurations without restriction fi(5 +1,5 + 1), 
subtract the number of configurations without a leg in 
the bottom row f2(5 + 1,5) or in the leftmost column 
f2(5, 5+1), and then add the number of configurations 
without a leg in both the bottom row and the leftmost 
column f2(5, 5), since we subtracted it twice. This leads 
to @. 

We are interested in the properties of the center of 
mass of the legs 



R 



1 L 

3 = 1 



(5) 



Due to the symmetry of the hopping rules, the mean 
position of the spider does not change (R) = 0, and we 
arc interested in the variance (R 2 ) of the center of mass. 



A. Bipedal spiders 

Here we show that the bipedal spider with arbitrary 
maximal span 5 has the diffusion coefficient 



(6) 



The bipedal spider with 5=1 corresponds to the bipedal 
spider with I — \/2 in the Euclidean version, and from 
^ we indeed recover the already known result D = 1/4. 



To derive ^ we start by noticing that bipedal spi- 
ders are "completely symmetric" : In each configuration 
the spider hops at the same rate in each direction. For 
completely symmetric spiders on the square lattice, a re- 
markably simple formula 

(R 2 ) = ^=4^, with D=^- 2 (7) 

for the mean-square of the center of mass, and respec- 
tively for the diffusion coefficient of the spider, was estab- 
lished in [T2] . In Eq. Q we denote by us the total rate the 
spider hops which is averaged over all stationary states. 
In general, for spiders with any number of legs and with 
symmetric hopping rates, the transitions between any 
two connected configurations occur at the same rates and 
hence all configurations have the same stationary prob- 
ability 1/6. Therefore the total rate which is averaged 
over all stationary states, in short the stationary average 
rate, is equal to 



1 C 



46 



(8) 



where uij is the total hopping rate from configuration j. 
Consequently, Q can be rewritten as 



D 



e 

— y 



(9) 



(Note that for the symmetric random walk on the square 
lattice L = 1 and ui = 4, hence the diffusion coefficient is 
one.) 

Let us first calculate the average hoping rate to a given 
arbitrary direction, say to the right. Notice that in most 
configurations both legs can jump to the right, but in 
some, only one of them. It is easier to enumerate the 
number of these later configurations, which is where one 
leg is blocked and cannot hop to the right. There is only 
one configuration where the two legs are next to each 
other in the same row ••, due to the convention that a 
leg is needed both in the bottom row and the leftmost 
column. In this configuration only the right leg can hop 
to the right. There are 5+1 configurations where one 
leg is in the bottom left corner (0, 0), and the other is at 
maximal distance x = 5, hence only the left leg can hop 
to the right. There are further 5 configurations where 
the right leg cannot hop: one leg is in the bottom right 
corner (5, 0), and the other leg is in the leftmost column, 
and in row y = 1, . . . , 5. Hence all together there are 
1 + (5 + 1) + 5 = 2(5 + 1) configurations where only one 
leg can hop to the right, and 6 — 2(5+1) where both can. 
Now using the fact that the average rate is the same for 
all four directions, equation ^ leads to 



2(5 + 1) 



2(5+1)] 1 
w= 6^ = 2 ~5 (10) 

where we have taken into account that the number of 
configurations for bipedal spiders is 6 = 25(5 + 1), as it 
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follows from Q. Substituting (10 1 into Q we arrive at 
the diffusion coefficient 



B. Multipedal spiders 

It is much more challenging to compute the diffusion 
coefficient for multi-leg spiders. The chief reason is that 
spiders with more than two legs are not completely sym- 
metric and hence one cannot use Q. To show the lack 
of symmetry it suffices to provide a configuration where 
the left and right hopping rates of the spider are differ- 
ent. Consider for example a tripod with two legs being in 
the same column and the third leg being at the maximal 
distance S in the x direction from both of the other legs, 
as illustrated here 



S 



(11) 



This spider can hop to the right at rate two and to the 
left at rate one. (More precisely, both legs on the left 
can hop to the right, but not to the left; the leg on the 
right can hop to the left, but not to the right.) Similar 
configurations can be easily constructed for any spiders 
with L > 3 legs. 

The matrix method described in Ref. [12] can be gen- 
eralized to arbitrary dimension and in principle it allows 
one to analytically determine the diffusion coefficient for 
any spiders with sufficiently small number of legs. How- 
ever, even in the simplest examples the exact calculations 
are rather cumbersome. For instance, even for the sim- 
plest S — 2 tripod on the square lattice the number of 
configurations is equal to 48 [see Eq.Q], so the compu- 
tation of D leads to the necessity to diagonalize a 48 x 48 
matrix; obtaining this matrix is very laborious. 

We performed simulations for the simplest spiders to 
measure their diffusion coefficients. This quantity is rel- 
atively easy to measure by probing the asymptotic of the 
mean-square displacement. It turns out that the cor- 
rection to the true asymptotic decays is rather small, 
namely, 



(R 2 



ADt 1 



(12) 



for multi-pedal spiders. (Similar corrections were ob- 
served in one dimension.) Note that there are no cor- 
rections at all for continuous time random walks (the 
mono-pedal spiders) . The measured values of D are sum- 
marized in Table [U 

To detect the motion of a single spider is still experi- 
mentally impossible, while various techniques allow one 
to count the total number of visited sites [9j . Since nu- 
merous spiders are usually released [9] , dividing the actu- 
ally observed total number of visited sites by the number 
of spiders makes the mean number of sites visited by a 
spider accessible. In the limit when the density of spiders 
is very low, the same result will emerge if we take a single 
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TABLE I: Simulation results for the diffusion coefficients D 
for spiders with L legs and constraint S. For the random walk 
(L = 1) the diffusion coefficient is D = 1; for bipedal spiders 
(L = 2) we have exact results given by ©. 
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TABLE II: Simulation results for the amplitude A in the 
asymptotic law for the number of visited sites (N) = At/ Int. 
Spiders have L legs and constraint S. For the random walk 
L = 1 and the amplitude is A = 4tt. 



spider and average the number of different visited sites 
over many realizations. 

For a symmetric random walk which hops at rate one 
in each direction (so its diffusion coefficient is D = 1), 
the mean number of different sites visited by the random 
walk scales as [251 



(N) = 



At 
In Bt 







t 



(hit) 3 



(13) 



in the large time limit. The amplitudes are A — Air and 
B = 32 cxp(C E - 1) = 20.97 . . . where C E = 0.577215 . . . 
denotes Euler's constant. (In Ref. [25], the random walk 
with diffusion coefficient D = 1/4 has been analyzed; to 
recast the prediction of [25] to our setting, where D = 1, 
we rescaled time by a factor 4.) 

We simulated the motion of different spiders and we 
obtained the same asymptotic behavior (13), with A and 
B depending on the number of legs L, and the constraint 
S. We are mainly interested in the leading order behavior 
A, however, fitting also the next correction B is unavoid- 
able in order to get an estimate for A, due to the large 
sub-leading corrections. The results for the coefficient A 
are summarized in Table [TTJ 

One can see from Tables |I| and \H\ t hat , according to sim- 
ulations, D(L, S) and A(L, S) are monotonically decreas- 
ing functions of L, and monotonically increasing func- 
tions of 5* for multi-pedal spiders. The simplest conjec- 
ture is that asymptotically the spider is indistinguishable 
from the random walk on the square lattice. Mathemati- 
cally, this would imply that A = AnD. Simulation results 
for the diffusion coefficient show that A is slightly differ- 
ent than that. 
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III. EUCLIDEAN SPIDERS 

In this section we consider the more realistic Euclidean 
spiders. Thus we use the standard Euclidean metric to 
measure the distance between the legs: 

|r, - Tj\ = {xi - Xj ) 2 + ( Vl - Vj ) 2 

The simplest Euclidean spider is bipedal with legs sepa- 
rated by maximal distance £ = v2. This spider is iden- 
tical to the simplest L = 2, S = 1 von Neumann spider; 
it is characterized by four configurations 0, and it has 
diffusion coefficient D = 1/4. 

For the bipedal spider with maximal separation £ = 2, 
there are six different configurations — four configura- 
tions (JlJ and two additional configurations, the horizon- 
tal configuration • o • and its vertical cousin. The com- 
putation of the diffusion coefficient gives D = 1/4 [see 
Eq. (Al) in the Appendix], so this spider has the same 
diffusion coefficient as the previous one. This surprising 
result is a coincidence rather than a rule. We computed 
diffusion coefficients for bipedal spiders with many other 
maximal distances. For I = y5, four new configurations 
(with legs separated by the move of a knight in chess) 
appear. Next change occurs for £ — 2y2 when two new 
"long" diagonal configurations arise. Then for £ = 3, 
the horizontal configuration • o o • and analogous verti- 
cal configuration become possible. Varying £ up to -\/50, 
the total number of distinct allowed configurations 6 and 
the diffusion coefficient exhibit the behaviors summarized 
in Table HE 

The diffusion coefficient tends to increase with £, yet 
its behavior is somewhat erratic and it can occasionally 

Hprrpflqp <r 2 19 <-• A 17/29 49_ ^ 4 33 ^ 65 

UCUC<BC U4 ^ 8' 48 ^ 22' 40 ^ 68' 112 ^ 9' 74 ^ 144' 

etc.). The last column reveals remarkable hidden regu- 
larities — the quantity (1/2— D) |6| is always half-integer 
with equilibrium patches of increasing length punctuated 
by upward jumps by 1/2. These intriguing observations 
are explained by the neat general formula 



D 



(14) 



where [£\ is the integer part of I. 

The derivation of ( 14 ) is somewhat lengthy (see the 
Appendix), but it is just an application of general for- 
mula ([9]) for the diffusion coefficient. The dependence of 
C and rates on I is non-trivial and cannot be deduced 
analytically for an arbitrary £. Indeed, the problem of 
counting the total number of configurations is equivalent 
to the problem of computing Ji^ which gives the total 
number of lattice sites within the disk of radius £; more 
precisely, C = (3\f^ — l)/2. The investigation of con- 
stitutes the celebrated Gauss problem [27] • Of course, 
'Ni approximately grows as the area, 'Ne ss tt£ 2 ; the devi- 
ation from this dominant growth law is extremely diffi- 
cult to probe analytically. (The sub-leading asymptotic 
is unknown. More precisely, the proven upper bound on 
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TABLE III: Total number of configurations and diffusion co- 
efficients for Euclidean spiders. 



the growth of the sub-leading asymptotic is substantially 
weaker than the conjectural one; proving the conjectural 
asymptotic is known to be equivalent to proving the Rie- 
mann conjecture.) This subtlety is irrelevant as long as 
we are satisfied with the leading asymptotic. In our prob- 
lem the case of large £ is particularly simple as both legs 
essentially diffuse independently and therefore the diffu- 
sion coefficient of the center of mass is very close to the 
half of the diffusion coefficient of the random walk. Thus 
D e -t 1/2 when £ -> 00. Note that D < 1/2 for any 
two-leg spider. Indeed, since ojj < 8 (the equality oc- 
curs when each leg is allowed to hop to each of the four 
neighbors) , the sum on the right-hand side of ^ cannot 
exceed 8x6 which proves D < 1/2. 

The derivation of ( 14 1 and a detailed description of the 
configurations for the threshold up to £ = are given 
in the Appendix. 



IV. SPIDERS WITH LARGE MAXIMAL SPAN 

Interactions between legs (due to exclusion and the 
maximal span constraint) imply that molecular spiders 
are complicated self-interacting objects. Therefore it is 
very difficult to compute the dependence of the diffu- 
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sion coefficient D(L, 5) of a spicier on the number of legs 
L and the maximal span 5. The behavior of D(L, 5) 
simplifies when the maximal span 5 becomes large. In 
the 5 — y oo limit, typical separations between legs grow 
with time thereby making exclusion asymptotically neg- 
ligible. Therefore in this limit we can treat legs as non- 
interacting random walkers. The diffusion coefficient of 
such spider is D(L 7 5 = oo) = 1/L. In this section we de- 
rive this result and then argue that the finite 5 correction 
has the 1/5 form. 



A. Non-interacting legs 

Consider a spider with L non-interacting legs. Each leg 
performs a random walk with hopping rates one in each 
direction. The mean and mean-square displacement for 
each leg read 



0, 



4f, 



1. 



(15) 



where we have assumed that initially all legs are at the 
origin. (We continue to assume that the spider moves 
on the square lattice; generally on the cubic lattice in d 
dimensions the amplitude on the right-hand side of ( 15 ) 
is given by 2d.) Using ( [l5| one computes the variance of 
the center of mass ([5} of the spider 



<R 2 ) = L 



L 
-2 \ 



^ N r|) = L~ 1 4t 



implying that the diffusion coefficient D(L) of the spider 
with L non-interacting legs (each performing the random 
walk with diffusion coefficient D = 1) is 



D(L) = 



1 



(16) 



This result is remarkably universal: It is valid in any 
dimension and it also does not depend on the lattice, the 
only requirement is the absence of bias. 



B. Interacting Legs 

We now return to our original spiders which move on 
a lattice and obey two rules: (i) Two legs cannot be 
attached to the same site; and (ii) the distance between 
any two legs does not exceed 5. In the 5* — > oo limit 
the legs are asymptotically non-interacting. Therefore 
Eq. (16} implies that D(L, S = oo) = 1/L. 

The non-trivial task is to compute the leading correc- 
tion which describes the deviation from ( 16 ) in the situ- 



ation when the maximal span 5 is large, but finite. To 
guess the 5 dependence of the leading correction let us 
look at known exact results for spiders moving on the one- 
dimensional lattice. For the bipedal spider the diffusion 
coefficient is given by [12] 



D(2,S) 



1 



1 - 



1 



(17) 



More generally in one dimension for multi-leg spiders 
with nearest-neighbor hopping and the constraint on the 
maximal span (spiders with global constraint in termi- 
nology of Ref . [TJ] ) , the diffusion coefficient reads [T^] 



D(L,S) = 



1 - 



L - 1 



(18) 



This formula is valid for L 
all L > 2,5 > L- 1. 



1 (the random walk) and 



Equations (17)-(18) show that at least in one dimen- 
sion the leading correction has the 5 _1 dependence on 
the maximal span. Wc now demonstrate the universality 
of this behavior by analyzing the bipedal spider in one 
dimension with a more complicated gate. Namely, let us 
assume two types of hops: the nearest-neighbor ±1 hops 
occur with rate p each, and the next-nearest-neighbor 
±2 hops occur with rate q each. For a random walk with 
such gate, the diffusion coefficient is D = p + 4g, and 
since we always normalize this diffusion coefficient of the 
random walk to unity, we set 1 = p + 4q. Using methods 
of Ref. [T2] , one obtains 



D(2,S;p) = 



1 



1 - 



P 



S 



(19) 



showing that the 5 _1 behavior is universal, namely it is 
insensitive to the details of the gate. 

In two dimensions all known results also agree with the 
5 _1 behavior of the leading correction [28]. The exact 
diffusion coefficient for the von Neumann bipedal spider 
^ is in perfect agreement (no other corrections). For 
the Euclidean bipedal spider the behavior is more com- 



plicated [see ( 14 )] ; the asymptotic behavior is very sim- 
ple, 



£>(2, oo) - D(2,S) 



1 

ttS 



(20) 



and it agrees with the 5 _1 asymptotic. Thus the exact 
results (17H19), and the asymptotic (20) suggest 
the following conjectural asymptotic behavior 



D(L, oo) - D(L, 5) 



A d (L) 
5 



5 — > oo 



(21) 



Intuitively, the 1/5 correction stems from the ratio of 
spider-leg configurations where the constraint is relevant 
(the distance of two or more legs is 5) to the total number 
of such configurations. The total number of configura- 
tions is essentially a volume of a domain of characteristic 
size 5, while the number of configurations for which the 
constraint is relevant is the surface area of that domain, 
so the ratio is indeed proportional to 1/5. 

We have presented evidence in favor of the conjectural 
behavior (21) in one and two dimensions, but it is prob- 



ably valid in arbitrary dimension d. The limiting diffu- 
sion coefficient D(L, oo) = 1/L is universal, while the 
amplitude Ad(L) in the sub- leading term depends not 
only on the number of legs, but also on the spatial di- 
mension d, and on the details of the gate. In all known 
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examples the amplitude Ad(L) is positive, e.g. accord- 



ing to (18) the amplitude is A\(L) = 1 — L in one 



dimension. The positivity of A^L) is physically evident 
(the constraint on the maximal span makes spiders less 
motile), although it is not clear how to prove this positiv- 
ity. Finally we note that simulation results for diffusion 
coefficients presented in Table |T] are in surprisingly good 
agreement with the conjectural asymptotic (21 1, namely 
the quantity S[D(L, oo) — D(L, S)} already changes very 
little when S increases from 2 to 3. 



V. NON-MARKOVIAN EFFECTS 

In this section we continue to assume that the re- 
attachment of a leg is instantaneous. In contrast to 
Sees, [n] and |III| however, we take into account the ef- 
fects of memory associated with previous visits of the 
legs. These effects are unavoidable in most experimental 
realizations — the first time a leg visits a site (an un- 
cleaved substrate), the leg hops from this site only after 
it has cleaved it into a product. (The product is un- 
affected by future visits, so we only need to know if the 
site has been visited in the past or not.) Thus the motion 
of the synthetic molecular motor, the molecular spider, 
irreversibly changes the environment making the prob- 
lem non-Markovian. Intriguingly, one natural molecular 
motor, a special protein called collagenase which moves 
along collagen fibrils, exhibits even stronger irreversible 
effect on its one-dimensional track and undergoes a bi- 
ased diffusion [5UH52] . 

Our main interest is the leading order behavior of two 
quantities: the mean square position (R 2 ) ~ ADt, i.e. 
the diffusion coefficient, and the mean number of visited 
sites (N) ~ At I Int. We start with the one-leg spider, 
the random walk, where the effect of memory is asymp- 
totically negligible, yet the corrections to the leading be- 
haviors are qualitatively similar to those which arise for 
multi-legs spiders. 

The results reported in this section are mostly numer- 
ical. In two dimensions, all simulations are made for von 
Neumann spiders. The convergence to the true asymp- 
totic behaviors is slow, especially in two dimensions 
where it is logarithmically slow. Therefore to extract ac- 
curate numerical predictions for the diffusion coefficient 
D and the amplitude A we need to know functional forms 
of correction terms. We make use [in Eqs. ( 22 )— ( 24 )] of 



correction terms which provide good fit to the data. The 
functional forms of these leading correction terms are still 
lacking theoretical justification. 



A. Random walk, L = 1 

We consider the random walk which hops symmetri- 
cally to nearest-neighbor sites and changes substrates 
into products. As in Ref. [13], the change of the sub- 
strates to the products (which occurs after the leg first 



visits the site) is modeled by postulating that the hopping 
rate to each neighbor is equal to 1 for the site which has 
not been visited before (the substrate), and to r for the 
site which has been visited in the past. Thus the hopping 
rate is determined by the state of the issuing site, but not 
by the state of the target site. Mathematically, the pa- 
rameter r can be an arbitrary positive number, r > 0; in 
experiments, the detachment from the product is easier, 
so < r < 1 as we shall assume in the following. 

According to simulations, the memory has no effect 
on a random walk in the leading order. Independently 
of the value of r, we obtained D = 1 for the diffusion 
coefficient, and A — Air for the amplitude. The same 
universality was numerically observed in one dimension 
[13] ; for the amplitude A, the lack of dependence on r 
was established analytically [15] . 

To extract the leading order behavior from the sim- 
ulation data one has to investigate higher order correc- 
tions as well. Our numerical simulations indicate that the 
mean square position has the following correction terms 
in the presence of memory 



(R 



2Dt [1 
ADt [1 - 



■ai(r)/Vt\ 
a 2 (r)/\nt] 



d 

d- 



(22) 



while there are no corrections at all in the absence of 
memory r = 1, that is, 0^(1) = 0. The large corrections 
in two dimensions are especially important in determin- 
ing the diffusion coefficient. Fitting our data to (22) wc 
got D = 1 both in one and two dimensions. The ampli- 
tudes of the correction terms are positive, a d (r) > when 
< r < 1, and monotonically decreasing functions of r. 
Therefore the memory slows down the random walk in 
the next to leading order. This behavior is understand- 
able, as the random walk slows down at newly visited 
sites. 

For the average number of sites (N) visited during the 
time interval (0, t), the corrections to the leading asymp- 
totic are also more important (since they vanish much 
more slowly with time) in two dimensions. According to 
our simulations, 



( >M ^M*[l- C2 (r)/lnt] 



lnt 



d = l 
d = 2 



(23) 



The correction amplitudes are again positive, cjj) > 
when Q < r < 1, and monotonically decreasing functions 
of r. Fitting our data to (23) we found that the am- 



plitudes A d(r) do not depend on r for the random walk. 
(Equation ( 23 1 also describes the average number of sites 
visited by a spider, and in that situation the amplitudes 
Ad{r) do depend onr.) Note that in two dimensions we 
found the same type of asymptotic behavior both with 
and without memory, see Eq. (13), and for an arbitrary 
number of legs. 



0.45 





FIG. 2: Diffusion coefficient D describing the asymptotic be- 
havior of the mean square position of a spider (R 2 ) ~ ADt as 
a function of the memory parameter r. Simulation data are 
shown for the simplest bipedal spider with S = 1, and for the 
simplest tripod with S — 2. The no memory case corresponds 
to r = 1. 



FIG. 3: Amplitude A describing the asymptotic behavior of 
the number of visited sites as (N) ~ At / In t as a function 
of the memory parameter r. Simulation data are shown for 
the simplest bipedal spider with S = 1, and for the simplest 
tripod with 5 = 2. 



B. Multi-pedal molecular spiders, L > 1 

In order to investigate the effects of memory we nu- 
merically studied the two simplest spiders: the bipedal 
L = 2, S = 1 spider, and the tripod L = 3, S = 2. 

We have seen in Sec. [Tl] that when memory effects 
are ignored, the corrections to the diffusion coefficient 
D quickly decay with time (12). This feature makes D 
easy to measure. In the presence of memory, however, 
a much slower decaying correction term appears in two 
dimensions 



(R 2 > 



2Di(r)t [l-ai(r)M] 
4D 2 (r)t l-a 2 (r)/(lnt) 2 l 



d = l 

d = 2 



(24) 



In one dimension, the correction also decays slower with 
time in the presence of memory, yet it remains algebraic; 
the qualitative behavior is similar to the random walk 
with memory [see (22)]. The (ln<) -2 type correction ob- 
served in two dimensions is unusual; it is rather slow of 
course, yet it is faster than for the random walk. 



Using (|24b and ( 23 ) we extract the diffusion coefficient 



D(r) and the amplitude A(r); their dependences on r 
are displayed [29] in Figs. [2] and [3] Surprisingly, the 
slowdown of legs at new sites leads to larger diffusion 
coefficient and increases the average number of visited 
sites in the large time limit. The qualitative reason is 
that the slow leg at new sites keeps the other legs close 
to newly visited sites, which generates an effective bias 
toward new sites, and thereby it leads to the increase 
in the number of visited sites. This effect has already 
been observed, and quantitatively understood, for one- 
dimensional spiders [13]. Although here we talk about 
the asymptotic behavior, the actual average number of 



visited sites also becomes larger for smaller values of r 
after about a few hundred time units. 

An interesting new feature of two-dimensional spiders 
is that the tripod is somewhat more sensitive to the slow- 
down at new sites than the bipedal spider, as D changes 
more rapidly with changing r (see Fig. [2]) . Another re- 
mark is that, as one can see in Figs. [2J & [3j with large 
enough slowdown (r J$ 0.6) the tripod becomes faster 
than the bipedal spider without memory (r = 1). 

An intriguing feature of these two-dimensional spiders 
is that for sufficiently small r tripods become more motile 
than bipedal spiders, see in Fig. [2j A possible reason for 
this effect is that the spider with more legs stick to the 
domain of new sites more efficiently. 



VI. UNBINDING OF SPIDERS 

All our previous analyses have relied on the assump- 
tion that the re-attachment of a leg is instantaneous, so 
the process is controlled by detachment. This implies 
that spiders remain fully attached and never leave the 
surface. If the re-attachment takes time, the problem be- 
comes more complicated but not necessarily intractable 
— for molecular motors |33j . for instance, some analyses 
allow complete detachment (unbinding) from cytoskele- 
tal filaments 34 40J. 

In this section we treat a more restricted problem, 
namely we compute the probability that a spider remains 
attached. We disregard memory and lattice effects and 
focus on the attachment-detachment process. 

Consider a spider with L legs. Let H n (t) be the prob- 
ability that n of its legs arc attached. To simplify no- 
tation we set the detachment rate equal to unity; we 
shall disregard the difference between the substrate and 
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the product. We denote the attachment rate by A. The 
probabilities II n (£) with n = 1, . . . ,L obey 



dt 



= (n + l)II„ + i - nll„ 

- A(L-n)n„ + A(L-n + l)n„_i (25) 



where the terms on the right-hand side in the top line 
account for detachment and the terms in the bottom line 
describe re-attachment. Equation (25) remains applica- 
ble to extreme probabilities 111 and IT^ if we set 



n (i) = o, n L+ i(t) = o 



(26) 



The latter relation is obvious (by definition, the spider 
has L legs); the former relation is actually the assumption 
that if all legs are detached, the spider leaves the surface 
and never re-attaches to it. The initial condition is 



n„(o) - s n<L 



(27) 



if we imagine that the spider is initially fully attached. 
An analysis of the initial-boundary value problem 



(25|— (27| is rather straightforward, so we only present 



one asymptotic result which is valid in the most interest- 
ing limit of quick re- attachment, A > 1. In this limit we 
found that, for any L, the probability H(t) — Xm=i n n (t) 
that the spider remains attached at time t is exponential: 



II = exp(-L</A 



L-l\ 



(28) 



Equation (281 shows that the mean time for the spider 
to detach is 



(tl 



,L-1 



(29) 



in the A 1 limit. 

If we were only interested in the mean detachment 
time, we could have determined it exactly without com- 
puting the probabilities H n (t). Indeed, utilizing an exact 
solution for the adsorption time for the so-called one-step 
process [41] . one finds an exact expression 



(t L ) 



L 

E- 

a=l 



E 



(L-b)i(b-iy. 

' (a - b)\(L - a + b)\ 



(30) 



which is a polynomial in A. Keeping only the dominant 
a = 1 term we see that (30) reduces to (29). Displaying 



a few more terms we obtain 



{tl 



L 



L(L-l) 



+ A 



L-3 



L - 1 



L—l L(L — l)(£-2) 



The above analysis is "zero-dimensional" as we ignored 
the lattice. For instance, consider a tripod in the config- 
uration • • • and imagine that one of its legs detaches. If 
an extreme leg detaches, its re-attachment rate is clearly 



larger than the re-attachment rate of the middle leg. In 
equations (25) this feature is ignored. Therefore combin- 



ing detachment with diffusion makes the problem very 
complicated even in the absence of memory effects. A 
numerical analysis of models which take into account 
the possibility of the detachment has been undertaken in 
Refs. [UJ[r5]. For the bipedal molecular spider, the one- 
dimensional version of this problem could be tractable as 
long as the memory effects are ignored. 



VII. SUMMARY 

We investigated the motion of a single molecular spi- 
der on the square lattice. The limit when the motion 
of the spider does not affect the environment is tractable 
for bipedal spiders, while spiders with more than two legs 
remain very challenging for analytical work. For bipedal 
spiders we computed the diffusion coefficient with an ar- 
bitrary maximal span between the legs. Generally, the 
increase in the maximal span leads to the increase in the 
diffusion coefficient. This phenomenon strictly holds for 
one type of bipedal spiders (von Neumann) , while for Eu- 
clidean bipedal spiders an increase of the span can some- 
times, unexpectedly, decrease the diffusion coefficient. 

We explored the behavior of von Neumann spiders with 
more than two legs by means of numerical simulations. In 
general we found that the increase of the number of legs 
makes spiders less motile. The increase of the maximal 
span S (maximal allowed distance between the legs), on 
the other hand, makes spiders more motile. In the infinite 
span limit the diffusion coefficient is reciprocal to the 
number of legs, D(L,S — oo) = l/L. We argued that 
the leading large S correction to the diffusion coefficient 
is proportional to 1/5. We also considered the effect of 
spiders completely unbinding from the substrate, and we 
found that the time it takes grows exponentially with the 
number of legs. The reason is that a spider unbinds only 
if all of its legs simultaneously unbind. 

In experimental realizations, the legs usually convert 
substrates into shorter products that have a lower affinity 
for the legs. Assuming that the substrate turns into the 
product after the first visit of a leg, we investigated this 
non-Markovian problem numerically. We showed that 
the long-time behavior is diffusive in character. More 
precisely, we demonstrated that the mean-square dis- 
placement grows as t and the total number of distinct 
visited sites grows as t/lnt. The amplitudes are affected 
by memory. Furthermore, the non-Markovian nature of 
the problem leads to very large sub-leading corrections. 
For instance, the relative magnitude of the sub-leading 
correction to the mean-square displacement decays with 
time in a very slow (lni) -2 manner. 

The most surprising influence of memory is that the 
more spiders are slowed down on unvisited sites, the 
more motile spiders become. An explanation of this very 
counter-intuitive behavior is that the "stickiness" to un- 
visited sites generates an effective bias toward unvisited 
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sites, which results in the increase of the visited area. 
For example, although without memory an L = 3, S = 2 
spider is slower than an L = 2, S = 1 spider, with the 
memory effect the three leg spider can visit more sites on 
average. 



Surprisingly, the diffusion coefficient is the same as for 
the previous two-leg spider. 



3. y/5<£<2y/2 
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Appendix A: EUCLIDEAN BIPEDAL SPIDERS 



Four more configurations become possible: C = 10. 
The rates of these new knight configurations (Fig. [4]) are 
^knight = 4. The rates of the previous configurations are 



->(••) = w(« o •) 



6 



(A2) 



while the diagonal configurations reach the maximal pos- 
sible rate Wdiag = 8. Thus 

D = — — [6x4+8x2 + 4x4] = - 
16 10 L J 20 



For small t, we can count various configurations and 
compute their rates by hand. Here we record what hap- 
pens when £ varies up to V50. Configurations that arise 
when £ varies up to \/20 arc depicted below. 
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FIG. 4: 


Non-isomorphic configurations for 


£ = 


^20. 


Conn 



urations share the center site (filled disk) while another leg 
is in a labeled disk. The label counts the 'bifurcation' event 
when such configurations first appear (the first bifurcation is 
identified with £ = v2). For instance, the simplest knight 
configurations appear in the third bifurcation event (when £ 
passes through y/b) and hence the corresponding label is 3. 



1. ^2 < £ < 2 

There are four different configurations. The rates are 
u)\ = LU2 = ^3 = W4 = 4 and hence ^ gives D = 1/4. 



4. 2^2<^<3 

Two new diagonal configurations (Fig.JZJ) of length 2\/2 
become possible; overall 6 = 12. The rates remain the 
same [Eq. ( |A2| ], for horizontal and vertical configura- 
tions of both kinds, and for short diagonal configurations 
(wdiag = 8). The rates increase for knight configurations: 
^knight = 6. Finally for the new diagonal configurations 

,(2) 
diag 



(Fig. m the rates are = 4. Thus 

11 3 

D = — — [6x4 + 8x2 + 6x4 + 4x2] = - 
16 12 8 

Note that as the number of configurations from 6 to 10 
to 12, the diffusion coefficient also gets larger. 



5. 3 < £ < VW 

The rates of horizontal configurations are 

w(»») = 6, w(»o») = 8, u(»oot) = 2 (A3) 
and similarly for the vertical once. The other rates are 

Wdiag = 8, CJ d / g = 4, CJfcnight = 6 (A4) 

The diffusion coefficient D = 5/14 is smaller than the 
diffusion coefficient D = 3/8 that characterizes spiders 
with 2^2 < £ < 3. 



2. 2 < £ < 



10 < £ < vT3 



Two additional configurations are • o • and its vertical 
cousin, so that 6 = 6. We have 

= 6, U)(» o •) = 2, w di a g = 8 

and therefore 



D = ii[6 + 2 + 4]x2 = i 



Four new knight configurations obtained by hopping 
one lattice spacing in one direction and three lattice 
spacings in orthogonal direction arise. The rates which 
change compared to rates (A3)-(A4) are 



w(« o o •) = 6, weight = 8, w^ ght = 4 



( A1 ) The diffusion coefficient is D = 7/18. 



11 



I 


1 


2 


3 


4 


5 


6 


7 


(1,1) 


(2,2) 


(3,3) 


(4,4) 


5 


6 


8 


8 


8 


2 








8 


8 


8 





\/26 


6 


8 


8 


8 


6 








8 


8 


8 





^29 


6 


8 


8 


8 


6 








8 


8 


8 





\/32 


6 


8 


8 


8 


6 








8 


8 


8 


4 


%/M 


6 


8 


8 


8 


6 








8 


8 


8 


4 


6 


6 


8 


8 


8 


8 


2 





8 


8 


8 


4 




6 


8 


8 


8 


8 


6 





8 


8 


8 


4 




6 


8 


8 


8 


8 


6 





8 


8 


8 


4 




6 


8 


8 


8 


8 


6 





8 


8 


8 


8 


\/45 


6 


8 


8 


8 


8 


6 





8 


8 


8 


8 


7 


6 


8 


8 


8 


8 


8 


2 


8 


8 


8 


8 



TABLE IV: The rates of the linear and diagonal configura- 
tions for Euclidean spiders in the 5 < I < 7 window. 



7. Vl3 < f. < 4 

Four new knight configurations obtained by hopping 
two lattice spacings in one direction and three lattice 
spacings in orthogonal direction arise. The rates which 
change are 



,(2) _ 



M = 
knight 



j (3) - 

knight 



The diffusion coefficient is D — 9/22. 



8. Large £ 



We now give some explanations. The behavior of the 
quantity $ = (1/2 - D) G (see Table III I is easy to un- 
derstand. First, using Eq. Q we can re- write $ as 



2 16 ' 



3=1 



(A5) 



Now let us examine the increment of $ that occurs when 
new configurations are born. A direct counting gives 



(AC, A3l)diag = (2, 16) 
(AC, Aft) Unear = (2,8) 
(AC, ABJknigM = (4, 32) 
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TABLE V: The rates of the knight configurations for Eu- 
clidean spiders in the 5 < I < 7 window. 



We now outline following bifurcations. We classify 
various configurations into linear, diagonal, and knight. 
Each linear configuration is either horizontal or vertical. 
There are also two kinds of diagonal configurations while 
knight configurations have four different types. The rates 
of linear configurations and diagonal configurations [by 
(d, d) we denote the diagonal configuration of length dy/2] 
change with their size. Some of these rates are collected 
in Table IV (we present results in the window 5 < I < 7; 



overall we analyzed i < 11). 

In Table |V| we summarize how the rates of the knight 
configurations (nm denotes the knight configuration ob- 
tained by hopping n lattice spacings in one direction and 
to lattice spacings in another) vary with I. 



Plugging (A6b)-(A6c) into (A5) we find that 



(A$) liE 



1 

2 ' 



(A<f) c 



(A$) knight = (A7) 



Thus $ increases by 1/2 when a pair of linear configura- 
tions are born and does not change when configurations 
of other types are added. This explains why $ is half- 
integer and why the jumps in <3> occur when I passes the 
integer value. Hence $ = (\£\ + l)/2, where \_£\ is the 
integer part of £, that is, the largest integer not exceed- 
ing £. Plugging this into D — 1/2 — $/C we arrive at 



Eq. (14) 
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